Translating without in-domain corpus: Machine translation post-editing with online learning techniques

نویسندگان

  • Antonio L. Lagarda
  • Daniel Ortiz-Martínez
  • Vicente Alabau
  • Francisco Casacuberta
چکیده

Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda a,∗, Daniel Ortiz-Martínez b,1, Vicent Alabau b,1, Francisco Casacuberta b,1 a Institut Tecnològic d’Informàtica, Universitat Politècnica de València, Camí de Vera s/n, 46022 València, Spain b PRHLT Research Center, Universitat Politècnica de València, Camí de Vera s/n, 46022 València, Spain

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Learning for Neural Machine Translation Post-editing

Neural machine translation has meant a revolution of the field. Nevertheless, postediting the outputs of the system is mandatory for tasks requiring high translation quality. Post-editing offers a unique opportunity for improving neural machine translation systems, using online learning techniques and treating the post-edited translations as new, fresh training data. We review classical learnin...

متن کامل

Online Learning for Effort Reduction in Interactive Neural Machine Translation

Neural machine translation systems require large amounts of training data and resources. Even with this, the quality of the translations may be insufficient for some users or domains. In such cases, the output of the system must be revised by a human agent. This can be done in a post-editing stage or following an interactive machine translation protocol. We explore the incremental update of neu...

متن کامل

Machine Translation for Human Translators

While machine translation is sometimes sufficient for conveying information across language barriers, many scenarios still require precise human-quality translation that MT is currently unable to deliver. Governments and international organizations such as the United Nations require accurate translations of content dealing with complex geopolitical issues. Community-driven projects such as Wiki...

متن کامل

The ACCEPT Portal: An Online Framework for the Pre-editing and Post-editing of User-Generated Content

With the development of Web 2.0, a lot of content is nowadays generated online by users. Due to its characteristics (e.g., use of jargon and abbreviations, typos, grammatical and style errors), the user-generated content poses specific challenges to machine translation. This paper presents an online platform devoted to the pre-editing of user-generated content and its post-editing, two main typ...

متن کامل

Questing for Quality Estimation A User Study

Post-Editing of Machine Translation (MT) has become a reality in professional translation workflows. In order to optimize the management of projects that use post-editing and avoid underpayments and mistrust from professional translators, effective tools to assess the quality of Machine Translation (MT) systems need to be put in place. One field of study that could address this problem is Machi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Speech & Language

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2015